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Abstract 



A central result in extremal set theory is the celebrated theorem of Sperner from 1928, which 
gives the size of the largest family of subsets of [n] not containing a 2-chain F\ C F2. Erdos 
extended this theorem to determine the largest family without a fc-chain Ft C F% C . . . C F/. . 
Erdos and Katona, followed by Kleitman, asked how many chains must appear in families with 
sizes larger than the corresponding extremal bounds. 

In 1966, Kleitman resolved this question for 2-chains, showing that the number of such chains 
is minimized by taking sets as close to the middle level as possible. Moreover, he conjectured the 
extremal families were the same for fc-chains, for all k. In this paper, making the first progress on 
pH this problem, we verify Kleitman's conjecture for the families whose size is at most the size of the 

, k + 1 middle levels. We also characterize all extremal configurations. 

1 Introduction 

m 

Sperner's Theorem is a central result in extremal set theory, giving the size of the largest family of 
sets not containing a 2-chain F± C F2. Erdos later extended this theorem to determine the largest 



family without a /c-chain F\ C F2 C . . . C F^. A natural question is to ask how many fc-chains must 
appear in a family larger than this extremal bound. 

More precisely, we consider the following problem, first posed by Erdos and Katona and then 
extended by Kleitman some fifty years ago. Given a family T of s subsets of [re], how many fc-chains 
must J- contain? We denote this minimum by c&(re, s), and determine it for a wide range of values of 
s. This provides a quantitative strengthening of the Erdos result on the size of fc-chain- free families. 
^ , We shall now discuss the background of Sperner's Theorem and this problem further, before 

I presenting our new results. 

1.1 Background 

Extremal set theory is one of the most rapidly developing areas in combinatorics, having applications 
to other branches of mathematics and computer science including discrete geometry, functional anal- 
ysis, number theory and complexity. The typical extremal problem has the following form: how large 
can a structure be without containing some forbidden configuration? A classical example, considered 
by many to be the starting point of extremal set theory, is a theorem of Sperner p3]. An antichain 
is a family of subsets of [re] that does not contain sets F\ C F2. Sperner's Theorem states that the 
largest antichain has (^ n " 2 j) se ^ s i a bound that is easily seen to be tight by considering the family of 
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sets of size [^J . This celebrated result enjoys numerous applications and has many extensions, many 
of which are discussed in Engel's book [3]. One particular extension, due to Erdos [3], shows that 

the size of the largest set family without a k-chain, that is, /c-sets F\ C F2 C . . . C F^, is the sum of 

r n+k-2 1 

the k — 1 largest binomial coefficients, M^-i = X/_rn-*H-2i (")• When k = 2, we recover Sperner's 

l ~\ 2 I 

Theorem. 

Our problem is what we refer to as an Erdos-Rademacher-type extension of Erdos' theorem, a 
name we now explain. Arguably the most well-known result in extremal combinatorics is a theorem 
of Mantel [10] from 1907, which states that an w-vertex triangle-free graph can have at most 



edges. In an unpublished result, Rademacher strengthened this theorem by showing that any graph 
with ^ +1 edges must contain at least I Sj triangles. Erdos [5] then extended this to graphs with 
a linear number of extra edges, and in [6] studied the problem for larger cliques. More generally, for 
any extremal problem, the corresponding Erdos-Rademacher problem asks how many copies of the 
forbidden configuration must appear in a structure larger than the extremal bound. 

In the context of Sperner's Theorem, this problem was first considered by Erdos and Katona, 
who conjectured that a family with (i n y 2 j) + t se ^ s m ust contain at least t f" 2 ^-] 2-chains. Kleitman 
[9] confirmed the conjecture, and, in a far-reaching generalization, showed the minimum number of 
2-chains in a family of any fixed size is obtained by choosing sets of size as close to ^ as possible. He 
then conjectured (see 0(9]) that the same families minimize the number of /c-chains, a problem that 
has remained open for nearly fifty years. 

Conjecture 1.1. The number of k- chains in a family is minimized by choosing sets of sizes as close 
to ^ as possible. 

1.2 Our results 

In this paper we study these Erdos-Rademacher-type extensions of the theorems of Sperner and Erdos. 
We began by considering the case of 2-chains, and determined the minimum number of 2-chains in 
a family of any number of sets. Later, we discovered Kleitman had earlier obtained the same result. 
However, through slightly more careful calculations, and by introducing an additional argument, we 
are able to characterize all extremal families, as given below. 

Theorem 1.2. Let J 7 be a family of subsets of [n], with \F\ = s > (i n y 2 j) • Let r S be the unique 

— -\-r— 1 — 4-r 

half-integer such that \\, z _n (?) < s < \\ 2 _n_ ( n ). Then F minimizes the number of 2-chains 
if and only if the following conditions are satisfied: 

1. For every F G F, | - r < \F\ < § + r. 

2. For any A C [n] with ^ — r + 1 < \A\ < | + r — 1, we have A € T . 

— +r— 1 

3- If s < ^2^_n_ r (™), then {F £ F : \F\ = ^ ± r} forms an antichain. 

4- If s > ^2i=R-r (1) > th en {F ^ F : \F\ = 7j ± r} forms an antichain. 

Our main results verify Conjecture 11.11 for families of certain sizes. To begin with, recall that 
Erdos showed the largest family without fc-chains consists of the k — 1 middle levels of the hypercube, 
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whose size we denote by M^-i- If we were to add one set to this family, the best we could do would 
be to add it to the kih level, in which case we would create ( t^V )(k — 1)! fc-chains. Indeed, 
we show that every additional set must contribute at least this many new fc-chains, and the above 
construction shows this is tight when our extremal family is contained within the k middle levels. 

Theorem 1.3. If J- is a set family over [n] of size s = M^_i + t, then F contains at least 
t( [{n ^\ /2i )(k-l)\ k-chams. 

We are then able to extend our argument to work for larger set families, obtaining a result that 
is tight when the extremal family is contained within the k + 1 middle levels. 

Theorem 1.4. Provided n > 15 and k < n — 6, if J 7 is a set family over [n] of size s = + 1, then 
the number of k- chains in F is at least 

Go.-™) ( l< vry- M^t-V ) + ( r(n v ,/21 ) ©) <*- 

In both cases, we actually obtain stronger results (see Theorems 13.11 and 14.21 respectively), pro- 
viding stability versions of the above theorems, showing that if a family has close to the minimum 
number of /c-chains, it must be close in structure to the extremal example. These stability results 
are of interest even in the case k = 2, as one does not obtain any stability from the Kleitman proof 
for 2-chains. We then use the stability results to show that when the above bounds are tight, the 
extremal families are exactly as in Theorem 11.21 

1.3 Outline and notation 

The remainder of this paper is organized as follows. Section [2] contains a proof of Theorem 11.21 In 
Section [3j we prove Theorem 11.31 and then in Section |4] prove Theorem ll.4[ In the final section we 
present some concluding remarks and open problems. Appendix [A] contains the proof of a technical 
proposition needed for Theorem 11.21 

We let [n] denote the set of the first n integers. For a ground set X and an integer i, we denote the 
family of i-subsets of X by (f ) = {Y C X : \Y\ = »}. We let M k = E^n-k+m (?) be the size of the 
k middle, and thus largest, levels. Given a family F of subsets of [n], we let F{ = Fn (^) denote those 
sets in T of size i. The ^-shadow of a family is given by d i J- = {G : 3F G J 7 , G C F, \ G\ = \F\ — £}. 
For a subset F C [n], we define m(F) = max{\F\ ,n — \F\}. 

Given a set family F, Ck{F) denotes the number of fc-chains in F . For any n G N and < s < 2 n , 
we let Cfc(n, s) denote the minimum of Ck{F) over all families F of s subsets of [n]. When k = 2, if 
we have two families F and Q, then we let C2(F, Q) denote the number of 2-chains with one set from 
F and one set from Q. 

2 Counting 2-chains 

In this section we will prove Theorem 11.21 characterizing those families that minimize the number of 
2-chains. We essentially show that it is optimal to take sets of sizes as close to ^ as possible. The 
theorem then prescribes how the boundary sets can be distributed. 
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Sperner's Theorem shows that the largest antichain is given by one of the middle levels, that is 
either all sets of size |_^J or all sets of size |~^] . Obviously, an antichain minimizes the number of 
2-chains, as it has none. This theorem is then a natural extension of Sperner's Theorem, as it shows 
that to construct a family of any size that minimizes the number of 2-chains, one should start by 
taking sets of size [^J , then sets of size |_^J + 1, then |_^J — 1, and so on until one has a family of 
the desired size. As we shall show, these families are optimal, and so we may denote the number of 
2-chains in the first s such sets by C2(n, s). 

The idea behind the proof is as follows. If our family F contains a set F that is too far away 
from the middle (i.e. ||-F| — ^| > r), then we will show that we can shift F closer to the middle and 
decrease the number of 2-chains. Once we have our family contained in the 2r + 1 middle layers, a 
simple counting argument will give the characterization of extremal families. As the shifting process 
is essentially the same as in Kleitman's proof in [9], we relegate the proof of the following proposition 
to Appendix lAl 

Proposition 2.1. Let F be a family of s > (uw^l) su ^ se ^ s °f [ n ] minimizing the number of 2-chains. 
If A G F is of maximal cardinality, with \ A\ = ^ + m, then for any B C A, \B\ > § — m + 1, we have 

Be J. 

Assuming this proposition, we shall proceed to prove Theorem ll.2i 
Proof of Theorem 1 1.21 We prove the theorem by induction on s. 

For the base case, we take s = (i n / 2 j)- By Sperner's Theorem, it follows that any family F of 
this size that minimizes the number of 2-chains must be an antichain. It is well known that the only 
antichains of this size are the family of all sets of size [^J , or the family of sets of size . It is 
easy to see that these families are the only ones satisfying Properties 1 through 4, with r = or ^ 
depending on whether n is even or odd respectively. 

For the induction step, assume s > (^ n / 2 j); an d let F be an optimal family of size s. Suppose 
Property 1 were not satisfied. Since F and F' = {[n] \ F : F G F} have the same number of 
2-chains, we may assume there is a largest set F G F with |F| = ^ + t for some t > r. By 
Proposition 12. H it follows that for every G C F, \G\ > § — t + 1, we have G G F. Hence F 
is in at least 2jj=i ( 2 j ) 2-chains in F. Since F \ {F} is a family of s — 1 sets, there are at 
least C2(n,s — 1) 2-chains in F not involving F. Thus the number of 2-chains in F is at least 
C2(n, s — 1) + YlT=i (^ + *) > c 2( n i s — 1) + YliLi — c 2( n i s )) an d so F cannot be optimal, giving 

a contradiction. Hence if F is optimal, each F G F has ^ — r < \F\ < § + and so Property 1 is 
established. 

Now consider the case Yji=^- r +i CD < s - X^ 2 =H_ r (")• Since s > Z^ 2 =§_ r +i (")> and in light of 
Property 1, it follows that there exists some F G F with \F\ = § ± r; by symmetry, we may assume 
\F\ = § +r. By Proposition 12. 11 F must be contained in YlfL^ 1 = c 2(n, s) — C2(n, s—1) 2-chains 

with sets in F of sizes between ^ — r + 1 and ^ + r — 1. If F is contained in any 2-chains with sets 
of size 7£ — r, then by induction it follows that F has more than C2(n,s) 2-chains, contradicting the 
optimality of F. Thus F is incomparable to the other sets in F of sizes ^ ± r. Removing F, we find 
that F \ {F} must also be optimal, and thus Properties 2 and 3 follow. 

^ —-\-r 

Finally, suppose Yli=™- r il) — s — J2i=™- r U) - By Property 1, we know all sets in F have sizes 
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between \ — r and | + r. Let % = U? 2 r f'™l) be the family of all subsets of [n] of sizes between 
§ — r and § + r, and let £/ = % \ J 7 be those sets not in T . We have 02(F) = C2^K) — C2(G, %) + 02(G)- 
02(0) depends only on r, and hence on s, and is independent of the structure of T . We have 



02(0,70 = ?0 
Gee 




The parenthetical term is maximized when |G| = ^ ±r, and so C2(G,'H) is maximized when for every 
G £ Q we have |G| = S ± r. Finally, 02(0) is minimized when is an antichain, in which case 
C 2(G) = 0. Both of these conditions are satisfied by the construction outlined at the beginning of this 
section, and hence must also be true of any other extremal family. Thus to minimize 02(F), T must 
contain all sets of sizes between ^ — r + 1 and ^ + r — 1, and G = (™+ r ) U (2^7.) \ 3~ should be an 
antichain, establishing Properties 2 and 4. This completes the induction step, and with it the proof 
of Theorem 11,21 

□ 



3 Counting A;-chains 

We now seek a similar result for /c-chains, and thus to make some progress on Conjecture II, II In this 
section we verify the conjecture when the number of sets is at most that in the k middle levels, and 
in the next section we shall extend the result to the k + 1 middle levels. 

Note that if we take all M^-i sets in the k—1 middle levels, that is of sizes between |_^^^J + 1 and 
\~?r\ — -*-> an d then add t sets of size |_^-^-^J , we would create precisely t ( ) (k — 1)! /c-chains. 

Hence Theorem 11.31 is tight when < t < (ifo+jUAjj)' We shall in fact prove the following stronger 
theorem, which provides a stability result. In the following notation, we let r £ {^p, | } be such 
that the sets in the k middle levels have sizes between \ — r and § + r, and for a set F, we define 
m(F) = max{|F| ,n - \F\}. 



Theorem 3.1. Let T be a family of subsets of [n]. Then the number of k- chains in T is bounded by 




Proof of Theorem \3.1\ We prove the theorem by induction on \T\. If \T\ = 0, then there is nothing 
to show, as the desired lower bound is negative. 

For the induction step, we begin by noting that for every set F £ T with | \F\ — ^| > r, F can 
be in at most Hu^f?)(& — 1)' ^-chains. If not, then we could remove F, and applying the inductive 
hypothesis to T \ {F}, we would have the desired inequality. 

We now use an LYM-type inequality, counting the number of fc-chains in our family by considering 
permutations. We say that a permutation a £ S n contains a set F C [n], denoted F £ a, if 
{<r(l), <t(2), . . . ,cr(|-F|)} = F; that is, F is an initial segment of a. Note that if a contains k sets 
Fi, F 2 , . . . , Fk, then those k sets must form a /c-chain. For any set F C [n], we let S n [F] = {a £ S n : 



5 



F G a}. Since every permutation containing m sets contributes m — (T) < A: — 1 to the right-hand 
side of the sum below, it follows that 

(k-l)nl>J2\Sn[F]\- Yl \^=lSnlFi\\. (1) 

FeT F 1 CF 2 C...CF k eJ r 

As the second sum is over all /c-chains in our family F, this inequality will allow us to bound the 
number of /c-chains. Note that for any F G F, we have |5 n [F]| = \F\l \ [n] \ F\\, and for a /c-chain 
Fx C F 2 C. . . C F k , \nf =1 S n [Fi\\ = \Fi\\ Ui=l \Fi+l \ Fi\l\[n] \ F k \\ gives the number of permutations 
containing the fc-chain. 

We shall associate every /c-chain in F with either its minimum or maximum set, depending on 
which is further away from the middle level. For F € F with \F\ < ^, let C{F) = {F C ^ C ... C 
F k : \F\ + \F k \ < n}, and if F G F with |F| > §, let C(F) = {F 1 C . . . C F k ^ C F : \Fi\ + \F\ > n}, 
and, for convenience, define C(F) = \C(F)\. Note that we have partitioned the set of fe-chains 
{Fx C F2 C . . . C F k : Fi G F} into the disjoint sets {C(F)} i?g - r . We can thus rewrite inequality (UJ 
as follows: 



(k - l)n\ > ^ 



\Sn[F}\- \rf=lSn[Fi] 
FigF 2 C...CF k eC(F) 



(2) 



F&T 

To bound |njL 1 S' n [i 7 'j]| appropriately, we require that and [n] not be members of our family. 
This is given by the following lemma. 

Lemma 3.2. // \F\ < M n _\, and F minimizes the number of k- chains, then ^ F and [n] ^ F. 

Proof. Suppose we had [n] G F. Since \F\ < 2 n — 2, there must be some 7^ F G' F '. We decrease the 
number of A;-chains in F by replacing [n] with F, since any new /c-chain involving F was a A;-chain 
with [n] before. 

Similarly, if G F, we can replace it with any set [n] 7^ F ^ F. □ 

Note that M n _i = 2 n — 2. Thus, if \F\ > M n _i, then either we have all subsets of [n], or our 
family is missing just one set, in which case (as we explained above) it is best to remove either or 
[n\. In either case, the bound in Theorem 13.11 remains true. 

We now assume 1 < \F\ < n — 1 for all F G F . If we fix F\, \F\\ < n/2, then we maximize 

\nf =1 S n [Fi\\ = |*i|!Ilti l^+i \ F N IN \ b y takin § l^+i = 1 for 1 < * < fe - 1, and so 
jnfL^fFijj < |Fi|! (n - \Fi\ - (k - 1))!. The same holds true if we instead fix F k , \F k \ > n/2, and 
thus \r\^ =1 S n [Fi]\ < (\F k \ - (k - 1))! (n — \F k \)\. We can unify both bounds in the form Zf^^l - 

Moreover, by definition we must have C(F) = for any F with \\F\ — ^| < r — 1. Hence we split 
our sum based on how ||F| — ^| compares to r. Dividing through by n!, inequality ([2]) leads to 



where 



n/2+r-l 



( n \ X) / " ) ' 

F&T \\F\> i=n/2-r+l KiJ 
\\F\-n/2\<r-l 
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1 / t-H-F J 1 _ |Ai/2-r| + |Ai/2+r| ll F |-™/ 2 l= J ' 



w\ ( B £n(*-w (n/2 + j w + ,)(i-r)(^-i)'" 

|F|-n/2|=r 



and 



|F|-n/2|>r+l 

Note that S||i?|-n/2|=r ^C^O = c fc(-H _ X]||F|-n/2|>r+i C(-^)> an d so, substituting in S 2 , we obtain 

n/2+r 



+ k 




C{F) 



Now, since Q (^J = ( fc ^) , it follows that when m(F) > f+r, (» ,) flg>) < (^(fr), 

as ^(n — fc + l)<i|+r — /c + l< m(F) — k + 1. Hence the summand is minimized when C(F) is 
as large as possible, which, by our inductive hypothesis, is — 1)!. Substituting this into the 

inequality above gives 

+ k-i>y l —^-+ x 1 



( n W«/2+na_,\| - (n\ In \ (n/2+r\ 

\n/2+r)\ k-1 )\ r i=n/2-r FG-F: U/2+rA fc-1 / 

||F|-n/2|>r+l 

Rearranging gives the desired bound 

|F|-n/2|>r+l 

□ 

Given Theorem 13.11 it is easy to deduce Theorem 11.31 In fact, we are able to characterize all 
extremal families. 

Proof of Theorem \l.'J[ Suppose we have a family of sets J 7 , with \T\ = M^_i + t. Note that the 
contribution each set F G T makes to the right-hand side above is ( n i 2 X r )(k — 1)! if \ — r < 

\\F\) 

\F\ < ^ + r, and — 1)! otherwise. This contribution increases with \\F\ — ^|, and so to 

minimize the right-hand size we need all sets to satisfy \\F\ — ^ < r. Moreover, since the binomial 
coefficients (") are minimized over n/2 — r < i < n/2 + r when i = n/2±r, any extremal family must 
contain all sets of sizes between ^ — r + 1 and ^ + r — 1, with the remaining sets having size § =t r. 
It is easy to see that such a collection of sets gives Ck (T) > t( n/ k 2 X){k- 1)! above, as required. 

To classify the extremal families, note that we already know we must have all sets of sizes between 
?! — r + 1 and 5 +r — 1. If r = |, this consists of the middle k — 1 levels, giving Mfc_x sets. Hence we 
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have t sets of size ^ ± r. To obtain equality in ([I]), we must have every chain passing through either 
k — 1 or k sets of T . As all the sets in the k — 1 middle levels are in F, the chain can contain at most 
one set from J- of size § ± ?*• From this, we deduce that {F S T : \F\ = ^ ± r} must be an antichain. 

If r = ^rp, we know the middle k — 2 levels are full, and we have ( n /2+r) ^ se ^ s °^ s ^ ze § r ' 
In order to obtain equality in ([I]), we must therefore have every chain pass through at least one set 
in J- of size ^ ± r. Hence {G ^ J- : \G\ = \ ± r} must form an antichain. 

In particular, we note that the extremal families are exactly the same as for Theorem 11.21 □ 

We remark that Theorem 13. II is a stability result for Theorem ll.3l as our bound on Ck{J~) increases 
if we are missing sets with \\F\ — ^| < r — 1, or have sets with ||.F| — ^ | > r + 1. Moreover, the 
stability estimates we obtain can also be tight. For instance, if we replace I sets of size ^ ±r with sets 
of size t| + r + 1, Theorem 13. 11 together with some simple computations, shows that we should gain at 
least ^("^"^K^ ~ I)' extra fc-chains. Moreover, it is easy to check that we gain precisely that many 
/c-chains in the case when our family includes all of the (§ + rj-sets and none of the (§ — r)-sets in 
the shadow of the + r + l)-sets which were added. 

Similarly, if we replace i sets of size ^ + r — 1 with sets of size ^ + r, the theorem shows that 
we must gain at least ^/ 2 ^fr ^( fc-^iO — extra /c-chains. This is tight again, if our family includes 
all the sets of size S + r containing any of the replaced sets. As we remarked earlier, these stability 
results are new even in the case k = 2. 

4 Larger families 

While Theorem 11.31 provides a tight bound on the number of fc-chains appearing in families contained 
within the k middle levels, it underestimates the number of A;-chains appearing in larger families. 
This is because in our calculations we assumed every fc-chain had steps {Fi \ -Fj_i) of size 1, as this 
maximizes the number of permutations containing the /c-chain. However, when we are working with 
the k + 1 middle levels, we also have A;-chains with a larger step of size 2, and so we shall have to 
make our argument more robust in order to handle these chains. 

However, there is one additional difficulty. Recall that the number of permutations containing a 
fc-chain F l C F 2 C . . . C F k is given by |n^ =1 -S n [Fi]| = |iq|! ]Ji=l \ F i+l \ F N ( n ~ \ F k\V- If we fix F k , 
say, then we would hope that for fc-chains involving a larger step, the largest this can be is to have 
one step of size 2, and have all the other steps have size 1, as this is precisely the type of /c-chain that 
appears in our extremal families. Such A;-chains are in 2(|i^| — k)l(n — |-Ffc|)! permutations. 

Unfortunately, a chain with k — 2 steps of size 1 and one step of size \Fk\ — k + 1 (so that |iq | = 1) 
is contained in (|-Ffc| — k + l)!(ra — \Fk\)l permutations, which is larger than the bound we require. 
However, recall that we assign fc-chains to either F\ or Fp., depending on which is further from the 
middle. Thus, unless \Fk\ = n — 1, we would assign the above A;-chain to Fx, and so we would be 
fixing Fx and not F k . Hence the one case we need to avoid is having a A;-chain starting with a set of 
size 1 and ending with a set of size n — 1. 

We shall later provide a separate argument to show that there cannot be any sets of size n — 1 in 
an extremal family, thus bypassing this problem. In the meanwhile, for the purposes of our stability 
results, we shall assume there are no sets of size n — 1. We now require two arguments - one to bound 
the number of fc-chains with larger steps, and one to bound the total number of /c-chains. 
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We introduce the following notation for the remainder of this section. Given n and k, we let 
a = l" 1 ^] , so that the k middle levels are those sets of sizes between a — k and a — 1, and the 
(k + l)st middle level has sets of size a. For a set family F, we let C(F) be the set of /c-chains in F . 
We partition these into two subsets: C\(F) are those /c-chains F\ C F2 C . . . C Fk with l-Fj+i \ Fi\ = 1 
for all 1 < i < k — 1, and 02(F) = C(F) \ C\(F) those /c-chains with a larger step. We let C(F), 
C\(F) and 02(F) denote the number of /c-chains in these subsets respectively. As in Theorem 11.31 
we will again identify a /c-chain with one of its endpoints F\ or F^, depending which is further from 
the middle level, giving the partition {C(F)}p e jr of C(F). These sets will again be partitioned into 
C\(F) and C2(F), depending on whether or not the /c-chains have a step of size at least 2. Finally, 
C(F), C\(F) and 02(F) represent the sizes of the corresponding sets of /c-chains. 

4.1 Counting fc-chains with larger steps 

We begin by showing that large families must contain a number of /c-chains with a step of size at 
least 2. The following proposition also provides some stability, which we shall require to show that 
an extremal family cannot contain any sets of size n — 1 . 

Proposition 4.1. Let F be a set family of size \ F\ = + t\, with 1 < \F\ < n — 2 for all F G F, 

and with at least t2 sets missing from the middle k — 1 levels. Then 

C2(m ( tl + (^) 42 ) (") (J) (*-!),. 

Proof. We prove the statement by induction on t\ + t2 > 0, noting that we must have t2 > 0. The 
base case of t\ + 12 = is trivial, as in this case the right-hand side is non-positive. 

The proof will now run along very similar lines to that of Theorem 11.31 an d we shall just make 
a few changes to count only those chains with a large step. To begin with, when we are counting 
sets and /c-chains in permutations, we only want to consider those /c-chains with a large step. To 
ensure this, we shall not count /c-chains that appear consecutively in some permutation. That is, if 
a G S n contains the sets F\ C F2 C . . . C F s for some s > k + 1, we will count F\ C F% C . . . C 
F^-i C i 7 ) c+1 , but not F\ C F2 C . . . C F^. Thus every /c-chain we consider is bound to have some 
step of size at least 2. If s > k, then the number of such chains is (^) — (s — (k — 1)), and since 
(k - l)s - ((*) — (s — (k — 1))) <k 2 -k, it follows that 

(k 2 ~ k)n\ > (k - 1) E \S n [F}\ - £ \rt=iSn[Fi] 

FeT F 1 d...dF k aC2{T) 

-l)\Sn[F]\- £ \i = lSn[Fi] 

As before, we now seek to maximize the terms |n^ =1 S' n [-Fj] | . Provided we have 1 < \F\ < n — 2 for 
all sets F G F, if we fix one of the endpoints of the chain, the number of permutations it is contained 
in is maximized when we have one step of size 2, and all the other steps of size 1. Thus we can bound 
In^SnlF;]] by 2|Fi|!(n - \Fi\ - k)\ or 2(|F fe | - k)\{n - \F k \)\. Dividing through by n\, we have that 




k A -k> V -r^— I k - 1 



C 2 (F) 

FeF \\F\) \ \ k )2, 
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Now, by definition, we must have C%(F) = for all sets F in the k middle levels; that is, with 
a — k < \F\ < a — 1. Let F = {F £ F : \F\ < a — k — 1 or \F\ > a} be those sets outside the middle 
k levels. Thus 



fro(F)\ M 



i=o-fc Fefi^ F \ J \ { k')"i 

We may assume that for every set Fe j, C2{F) < (?) (^l (A; — 1)! = (k — 1) (?) ^> since otherwise 
we may remove F from F and are then done by induction. Hence, since m(F) > a for all F S F, the 
parenthetical term in the second sum is always non-negative, and so the right-hand side is minimized 
by replacing (^) by ("), giving 

e- k > v ^-^i i y 1 (k i C2(F) ) 

t=tt-A V*/ i? e jr W \ \ k I 2 / 

C2(F)\ 



> E t + E w (* - 1 

i=a-k Vi^ p g j- Va/ \ 

E 



2 

(fc- 1)|^| (fc-l)|J-| C 2 (.F) 

/n\ + m\ / n \ /a\ fc! ' 

a-fc U/ U/ \k) 2 



and so 



g^F) ^ (fc- 1)|^| _ 2 (A:-1)|.F| 

W \k) 2 i=a-k \i) \aJ 

If the middle k levels were full, then the first sum would equal k 2 — k. Since we must have at least 
t2 sets missing from the middle k — 1 levels, the right-hand side is minimized when there are exactly 
t<i sets missing, all of size a — 1. In this case, \F\ = t\ + 1^, giving 

Cm ^ « + _ /I 1 \ / + ^ * - 1 



ooi - o lo («-,); v v « j j o' 

As a = \^^~\ , we have 2a — n — 1 > k — 1, and so multiplying through by gives the 

desired bound. □ 



4.2 Counting all £>chains 

As Proposition 14.11 offers us some control over the number of chains with large steps, we can now 
proceed to bound the total number of fc-chains in F. Again, our result provides somes stability, as 
we shall require to forbid sets of size n — 1. 

Theorem 4.2. Let F be a set family of size \F\ = + 1\, with 1 < \F\ < n — 2 for all F £ F, and 
with at least 1% sets missing from the middle k — 1 levels. Then 

™ *(.:») (vr) <* - ■> ♦ (* + (^) *) (G - - (:) ©) <* - * 
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Proof. We prove the theorem by induction on t\ + t<i > 0, and again must have t2 > 0- The base 
case of t\ + 12 = follows from Theorem 11.31 as when \F\ = = M^_i + ( a " fe ), the right-hand side 
above is less than the lower bound for t = ( in Theorem 11.31 

We may now assume that any set F not in the middle k — 1 levels is contained in at most 
((fc-i) + it) (2)) (^ ~ 1)' ^-chains. If not, then we may remove F from F, thus decreasing t\ by 1. 
Applying the inductive hypothesis to F \ {F} and adding the /c-chains involving F then gives the 
requisite number of /c-chains. 

We once again seek to bound the number of /c-chains in our family by counting sets and /c-chains 
in permutations, except this time we shall consider all /c-chains appearing in the permutations. A 
permutation with s sets gives rise to /c-chains, and since for all s we have ks — (^) < /c 2 — 1, it 
follows that 

(k"-l)n\>k^\Sn[F]\- | n ti^[*i]| (3) 

FeF Fic...cF fc eC(J^ 

= J2\k\Sn[F}\- £ In^iW 

F&F \ FiC...CF fc eC(F) 

To maximize |n^ =1 S' n [i 7 i]|, since we have no sets of size n — 1, we should again take all the gaps 
to be as small as possible. Those chains in C\(F) all have steps of size 1, while those in 62(F) should 
have one step of size 2, and the rest of size 1. Dividing by n\ gives 



d(F) C 2 (F) 



FeF M 



By definition, if a — k + 1 < |F| < a — 1, we must have C\(F) = 02(F) = 0, and if |-F| = a — Zc, 
then 02(F) = 0. Thus we have three types of /c-chains to consider: those in C\(F) for \F\ = a — k, 
those in d(F) for F G J" = {F € J 7 : |F| < a - jfe - 1 or |F| > a}, and those in C 2 (F) for F <E F. 
Splitting our sums thus, we obtain 



a-l 



k 2 -i> V 



k\F\ Z\F\=a-k C l( F ) L 1 A Ci(F) 



+ 



i=a—k 



F&T 



^ ( " A /m(F)\ fc! 



Since 



53 C 1 (F) = C(F)-53C 1 (F)-^C 2 (F), 

\F\=a-k F( zjr Fe jr 

we can substitute this expression into the second sum, and redistribute, to obtain 



> 2^ Twr-C* -i) + Si + E 2 , 



L- fc )(Vf)(^-D'-^ 
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where 

Ci(F) d(F) 



The following lemmas, whose proofs we defer to the end of this subsection, allow us to bound 
these sums. 

Lemma 4.3. For every F £ T , we have 

1 C X {F) G^) 



(A) (£|)ffi?)(* " (a-J( n ^f )(* " 1)! " L- k )( n k-l k ) ' 

Lemma 4.4. For every F £ J 7 , we have 

1 4-i-iSl)>±L, ft < F A 



We now replace our summands with these lower bounds, obtaining 
> k \Fj\ _ 2 _ (fc-i) 

\a-k)\ fe-1 A ft ^ i=a-k \iJ p e jr \a-k) \ fc-1 / 



fej^j n . 2 ^ i ij-i ( (fe-i) i fc- A 

i=a-k \i' \\a-k)\ k-l I \aJ / 



We can use Proposition 14,11 to lower bound (^(J 7 ). Moreover, as a > n — a + k, it follows that 
each set in J 7 , whose size is not a, has greater weight than any set in the middle k levels. Thus, the 
right-hand side is minimized when we fill the middle k — 1 levels as much as possible. If we were to 
have the full k middle levels, the first sum would be equal to k 2 . However, as we must have at least 
t2 sets missing from the middle k — 1 levels, it is best to have exactly t% sets of size a — 1 missing, 
resulting in 

C{T) kt 2 ( (^J k-l\ 



Co-*) ( Vt*) V ( Vt fc ) (3 7 

, (*! + (¥) *2)(g)@ (fc-1)! A 2 \ 

L"J(Vf)(fc-i): V n-o + i; 

= 1 + Aiti +A 2 t 2 , 
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where, after simplifying the binomial expressions, we find 

A (£) , (3@ , 2(g)® 



( n \ /n—a+k\ 
\a-k) \ fc-1 7 



-, and 



4 (*-i) , MM i k ~ 1 k 2(*"l)(2)ffi 



k-lf + (2) (J) , 2a-n-A J + (f 

! /r>\ I — 



Multiplying through by ( "jj ( n ^"^ fc )(A; — 1)! gives the desired bound. □ 
To complete the proof, we now prove the two lemmas. 

Proof of Lemma^ Note that (^Cg?) = i) [ m ?F) k -k+i) > and ' b y the same token > ( a -J("-!l : 
(fc-lKT-fc 1 )- Since a - fc = LNr^J' and ' as F G J^, we have m{F) > a, it follows that 
(\F\) ("k-i) ^ (o-fc) ( n fc-t fc )' Thus the left-hand side of the inequality is minimized when we choose 
C\(F) as large as possible. By definition, C\(F) < ("t^i)(k — 1)!. Making this substitution gives 

1 C X {F) Cl (F) gg) 



□ 

Proof of Lemma \4-4\ If m(F) = a, then we have equality, so we may assume m(F) > a + 1. By 
induction, we can assume that no set is in more than ^ Ca;— l) CD©) ~~ 1)' ^-chains, giving a 
bound on C^-F). Thus 

* ( G - + G) G)) C t 1 ) G) »- ^ »- KT) f 

and so the factor — 1 — ^pf^ k , ~\ is non-negative. As (™) > (|^|), we thus have 

1 f t _! _mlifi-.-sil>iL. c ^ 



M l (m{F)\k\ ] — ( n \ \ (m(F)\k\ ~ ( n \ \ ( a \!A 

\F\J \ { k J 2/ \a) \ [ k )-2 ) \a) \ \k) 2 



□ 



4.3 Forbidding large sets 

Given the previous theorem, all that remains is to show that an extremal family cannot contain sets 
of size n — 1. The idea behind this is as follows. A set of size n — 1 has a very large shadow in the 
k — 1 middle levels. In order for this set to not contain too many fc-chains, we must therefore be 
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missing a lot of sets in the k — 1 middle levels. By Theorem 14,21 it then follows that the remainder 
of the family must contain many more fc-chains than it ought to. The relevant calculations are given 
below. 

Proposition 4.5. Suppose n > 15 and k < n — Q, and let T be a set family with \F\ = M/% + t. If we 
do not have 1 < \F\ < n — 2 for all F G T , then 

> (- 2 *) (" * " T *) C* - X)« + * (( fc 1 J + (») (*)) <* - X>.. 

Proof. The same proof as in Lemma 13.21 shows that we may assume we do not have or [n] in T . 
Hence it suffices to show there are no sets of size n — 1 in our family. 

Suppose towards contradiction we had some set F G T with \F\ = n — 1. We may, by induction, 
assume that F is in at most ((^J + (£)©) ^-chains. 

Since F contains (™Zi) sets of size a — 1, there are ("Zi) (^2) ~~ 2 )! possible fe-chains that F 
might be in which consist of k — 1 sets from the k — 1 middle levels followed by F. Hence we must be 
missing a lot of sets from the k — 1 middle levels to prevent F from being in too many A;-chains. The 
sets of size a — 1 are contained in the most such /c-chains, so if we are missing t<i sets, we must have 



a-17 \k-2r -\\k-l \k \2 



Solving for ti gives 



(2 >i»- i V<V ( °-* +ip - i; 



v o-iy V 2 

We now remove from T all sets of size n — 1, thus losing at most n sets, and apply The- 
orem 14.21 with the above value of ti- In the theorem, the number of /c-chains is governed by 
the expression {t\ + (^p) £2)- We are decreasing t% by at most n, but increasing £2 by at least 
(™Zi) — o fl + IfLzMiK^Lili^ j resulting in a net gain in the previous expression of at least 

¥((::!) -K^^ 1 ^))-"- 

Since n — k > 6, we have n — a > 3. If n — a is some constant, then the first term is at least cubic 
in n, while the term we are subtracting is quadratic, since in this case a — k + 1 will also be constant. 
One the other hand, if n — a is large, the first term will be at least a large power of n, while the term 
we subtract is at most cubic in n. Given n > 15, some simple but tedious calculations show that in 
either case, having a set of size n — 1 increases the number of /c-chains our family must contain. □ 

Note that the condition k < n — 6 is near-optimal, since if k = n — 3, then = 2 n — 2n — 2, and 
so by volume considerations alone there must be extremal families with sets of size at least n — 1. 

Theorem 11.41 now follows easily. 

Proof of Theorem \1.4\ Since n > 15 and k < n — 6, Proposition 14.51 shows that 1 < \F\ < n — 2 for 
all F G T . We may then apply Theorem 14.21 with t\ = t and = to obtain the bound 
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Recalling that a = f 2 ^] , this is precisely the desired lower bound. 

We can again deduce a characterization of the extremal families. Note that we must have t<i = 
for the above bound to hold, and so the k — 1 middle levels must be full. In order to have equality 
in Lemma 14.41 we a l so needed m(F) = a for all F G P. If a = ^y^, then the remaining ( n ) + 1 sets 
must have size § ± § • To obtain equality in Q , every chain must pass through either k or k + 1 sets 
of F, and so we must have {G ^ F : \G\ = § ± |} forming an antichain. 

If, on the other hand, a = , then sets of size a = ¥■ + carry greater weight than sets of 

size a — k = ^ — We can then redefine ti above to be the number of sets missing in the k middle 
levels and obtain the same result. Hence it follows that we must have all sets in the k middle levels, 
with the remaining sets in P of size 2 ± ^±1 . In order to maintain equality in ([3]) , every chain must 
pass through at most one set F G F with |F| = § ± ^±1, and so {F G F : |F| = § ± ^±1} must be 
an antichain. 

Thus, once again, the extremal families are exactly the same as those that minimize the number 
of 2-chains, as given by Theorem ll.2i □ 



5 Concluding remarks and open problems 

In this paper, we have partially answered Kleitman's conjecture by showing that the families that 
minimize the number of 2-chains also minimize the number of /c-chains when they occupy up to the 
k + 1 middle levels. While we strongly believe the conjecture is true in general, we suspect new ideas 
are needed to deal with larger families. As the number of levels grows with respect to k, the number 
of different types of chains - in terms of the sizes of the steps between sets - grows rapidly, and these 
would all need to be controlled to obtain a precise result. In this direction, though, the same methods 
we have used above can be applied to show the following: if we have integers a±, CK2> ■ ■ ■ , Q-k-i with 
Y^i on = l — 1, then, provided \F\ < Mi and the largest set in our family has size at most n — maxj a,, 
the number of /c-chains F\ C i*2 C . . . C F^ with (i^+i \F{\ > a» is minimized by taking sets in the 
middle t levels. 

Considering the case of 2-chains, our paper has focused on showing that a family with more than 
(|n/2j) se ^ s mus t contain many 2-chains. A closely related problem is to determine whether such a 
family must have any sets contained in many 2-chains. This type of question has been studied before 
in other settings. For example, when one is considering the number of triangles in a graph, Erdos 

2 

showed in [5] that any graph with 2_ +1 edges must contain an edge in at least | + o(n) triangles. It 
is well-known and easy to see that the hypercube, a graph whose vertices are subsets of [n], with two 
vertices adjacent if they are comparable and differ in exactly one element, has independence number 
2 71 ~ 1 . Chung, Fiiredi, Graham and Seymour [1] proved any induced subgraph on 2 ra_1 + 1 vertices 
contains a vertex of degree at least + o(l)) log 2 n. It is an open problem to determine whether or 
not this bound is tight (the corresponding upper bound is 0(^/n)), and the answer to this question 
has ramifications in theoretical computer science. 

In the context of Sperner's theorem the above problem has a negative answer, which may be 
surprising given the previous two examples. For convenience, let us assume n = 2m + 1 is odd, and 
consider the following set family. Let 

JT = {F : 1 ^ F, \F\ = m} U {F : 1 G F, \F\ = m + 1}. 
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This family contains 2( 2 ™) = (l + ±) ( Ln %) sets, and so we are indeed beyond the Sperner bound. 
However, it is easy to see that the only pairs of comparable sets are of the form {F, {1} U F} for 
every F G T with \F\ = m. Hence each set of the family is in only one pair of comparable sets. 
In fact, for this family we have C2(J-) = C2(n, (J 7 )), so it is possible to have an extremal family with 
the comparable pairs distributed as evenly as possible. Theorem 11.21 shows that any family with 
2( 2 ™) + 1 sets must contain a set in at least two 2-chains. It is an open problem as to whether this is 
also the largest family without a set that contains two other sets (and hence is the maximum set in 
two 2-chains). This configuration is known as a 2-fork, and the upper bound, which can be obtained 
using similar arguments as in Theorem 11.31 is (l + \ ) ([„/2j)' as shown by Katona and Tarjan [8]. 

We find most exciting the prospect of studying Erdds- Rademacher-type problems in other settings. 
Within the context of Sperner's Theorem, a paper of Qian, Engel and Xu [13] studied an extension for 
multiset families, where the same set may be chosen multiple times. In a series of two papers, Mubayi 
[HI [12] extended the Erdds-Rademacher results to graphs other than cliques, studying the question 
for color-critical graphs and some 3- and 4-uniform hypergraphs. In a subsequent paper, we will 
present Erdds-Rademacher type strengthening of the Erdos-Ko-Rado Theorem. However, as one can 
investigate similar extensions for any extremal result, there is truly no end to the number of directions 
in which this project can be continued. We hope that further work of this nature will lead to many 
interesting results and a greater understanding of classical theorems in extremal combinatorics. 

Note added in proof: During the preparation of this manuscript, it came to our attention that 
Dove, Griggs, Kang and Sereni [2] have independently obtained Theorem 11.31 

References 

[1] F. R. K. Chung, Z. Fiiredi, R. L. Graham and P. Seymour, On induced subgraphs of the cube, 
J. Comb. Theory Ser. A 49 (1988), 180 - 187. 

[2] A. P. Dove, J. R. Griggs, R. J. Kang and J. S. Sereni, Supersaturation in the Boolean lattice, 
arxiv.org: 1303.4336 (2013). 

[3] K. Engel, Sperner Theory, Cambridge University Press, 1997. 

[4] P. Erdds, On a lemma of Littlewood and Offord, Bulletin of the American Mathematical Society 
51 (1945), 898 - 902. 

[5] P. Erdds, On a theorem of Rademacher-Turan, Illinois Journal of Math 6 (1962), 122 - 127. 

[6] P. Erdds, On the number of complete subgraphs contained in certain graphs, Magy. Tud. Acad. 
Mat. Kut. Int. Kdzl. 7 (1962), 459 - 474. 

[7] P. Erdds and D. Kleitman, Extremal problems among subsets of a set, Discrete Math. 8 (1974), 
281 - 294. 

[8] G. O. H. Katona and T. G. Tarjan, Extremal problems with excluded subgraphs in the n-cube, 
Graph Theory, Springer Berlin Heidelberg (1983), 84 - 93. 



16 



[9] D. Kleitman, A conjecture of Erdos-Katona on commensurable pairs among subsets of an n-set, 
Theory of Graphs, Proc. Colloq., Tihany (1966), 215 - 218. 

[10] W. Mantel, Problem 28, Winkundige Opgaven 10 (1907), 60 - 61. 

[11] D. Mubayi, Counting substructures I: color critical graphs, Advances in Mathematics 225 (2010), 
2731 - 2740. 

[12] D. Mubayi, Counting substructures II: hypergraphs, Combinatorica, to appear. 

[13] J. Qian, K. Engel and W. Xu, A generalization of Sperner's theorem and an application to graph 
orientations, Discrete Applied Mathematics 157 (2009), 2170 - 2176. 

[14] E. Sperner, Ein Satz iiber Untermengen einer endlichen Menge, Mathematische Zeitschrift 27 
(1928), 544 - 548. 

A The shifting proposition 

In this appendix, we prove Proposition 12.11 which enables us to perform the shifting necessary for 
Theorem 11.21 As mentioned in Section [21 this is essentially the same shifting argument used in the 
original proof of Kleitman in [9] . We provide the proof here as the details of the calculations are not 
included in Kleitman's paper. 

Proof of Proposition ^. 1\ Suppose not. Note that we must have m > 1, otherwise there is nothing to 
prove. Let £ < 2m — 1 be the minimal integer such that there exists a largest set of size ^ + m with a 
subset of size ^ + m — i that is not in the family. Let A = {A G T : \A\ = | + m, d £ A (£_ J 7 }, and let 
B = d e A \ T . We can construct an auxiliary bipartite inclusion graph on A U B, with an edge (^4, B) 
iff B C A. 

Consider first the case where we have a matching M : A — > B, so that for every set A G A there 
exists a set M{A) C A, M{A) ^ T . We shift the family from J 7 to J 7 by replacing each set A £ A 
by M{A) G B, and claim that this reduces the number of 2-chains. Note that if B = M{A) is a 
newly-introduced set, and C G J- is a set with C C B, then we must have had C C A as well. Thus 
the only 2-chains that we need to consider are those between the levels ^ + m and ^ + m — £; we call 
these intermediate chains. 

Suppose £ > 1. By the minimality of our choice of £, we must have d % A C T for every 1 < i < £— 1. 
Thus the number of intermediate chains in T that we lose is at least |.4| Y2i=i (^ m )- On the other 
hand, all sets of size | + m in J are the sets from J- with i-shadow completely in J- for all i < £. 
These sets cannot be involved in any 2-chains with sets in B, and therefore we only gain intermediate 
chains between the levels ^ + m — 1 and ^ + m — £. The number of such chains that we gain is at 
most \A\ Y^iZl {^~7 +e )- Since £ < 2m - 1, it follows that (^~™ + ^ < {^ m ) for every 1 < i < £ - 1, 
and hence the number of 2-chains decreases. 

Thus we may assume I = 1. If we had A G A and B £ T with B C A, \B\ = ^ + m — 1, then 
upon shifting to J 7 , we lose the 2-chain B C A and gain no pairs. Hence we may assume dA n J- = 0, 
so B = dA. We now claim that the sets A G A cannot be involved in any 2-chains C C A in T . 
Suppose to the contrary we had such a 2-chain. Let x G C be an arbitrary element of C, and shift A 
to A \ {x} (recall that A \ {x} J 7 ). Shift the remaining sets in A by an arbitrary matching from 
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A! = A\ {A} to B' = dA \ {A \ {x}}; we can do this by Hall's Theorem, since every set in A' has at 
least ^ + m — 1 neighbors in B', while each set in B' has at most ^ — m + 1 neighbors. In this shifted 
set we have lost the 2-chain C C A, and hence J- has fewer 2-chains. 

Hence we may assume that there are no 2-chains in T involving sets in A. Thus in the shifted 
family sets in dA will also not be in any 2-chains. Now, since \J-\ > (^„™ 2 j)' ^ follows from 
Sperner's Theorem that there is some 2-chain C C D in T . In T , we may also shift D to some set 
in dA, since m > 1 implies \dA\ > \A\. As no set in dA is involved in any 2-chain, this reduces the 
number of 2-chains, which contradicts the minimality of J-. 

Therefore we conclude that there cannot be a matching from A to B in the auxiliary bipartite 
inclusion graph, and so we will not shift all sets in A. Instead, we use the following lemma, to be 
proven shortly, to find a collection of sets to shift. 

Lemma A.l. Let G be a bipartite graph on UL)V with minimum degree 5u > 1 in U and maximum 
degree Ay in V . Suppose there is no matching from U to V . Then there exist nonempty subsets 
Ui C U and V\ C V with a perfect matching M : TJ\ ->■ V\ and e(Ui,V) + e(U \ Ui,Vi) < \Ui\A v . 

Our auxiliary graph satisfies the conditions of the lemma, with U = A, V = B, Ay = (* ™ + ^) < 
(^ m ) > an d so we can find a collection of sets Ai C A and a matching M : Ai — > B\ C B as given by 
the lemma. Consider the shifted family T where we replace the sets in A\ by the corresponding sets 
in B\. As before, since for every A G A\ we have M(A) C A, we need only consider the intermediate 
chains. 

Again, by the minimality of £, we know that A\ has full shadow in T up until the £ih shadow, and 
so the same calculation as before implies that we remove more chains than we gain, and thus have 
fewer intermediate chains in J 7 . Hence it suffices to consider only the new chains formed between 
levels 7^ + m and ^ + m — t. 

The number of new chains between these levels we gain is exactly e(A \ Ai,B%). On the other 
hand, we lose all chains between A\ and d^AidJ- = d^A\\B. Thus the number of chains we are losing 
is |.4i|(^ m ) ~e{Ai,B). By the lemma, we have |-4i|(^ m ) ~e{Ai,B) > -e(Ai,B) > 

e(A \ Ai, B\), and hence T has fewer 2-chains than J 7 , contradicting the optimality of J- ' . 

Thus if J- minimizes the number of 2-chains, and A is the largest set in J- with \A\ = + m, then 
whenever B C A with \B\ > Q — m + 1, we must have B G T as well. 

□ 

It remains to furnish a proof of Lemma IA.ll which we now provide. 

Proof of Lemma \A.1[ As there is no matching from U to V, by Hall's Theorem there exists a minimal 
subset Uq C U with \N(Uo)\ < \Uq\. Since 5u > 1, we must have |{7o| > 2. Let u £ Uq be an arbitrary 
element, and take U\ = Uo \ {u}. By the minimality of Uq, it follows that |./V(i7i)| > \U\\, and so we 
must have \N(Ui)\ = \Ui\. Set V\ = N(Ui). Again by the minimality of Uo, for any subset X C U\, 
|A^(X)| > \X\, and so by Hall's Theorem there exists a perfect matching M : U\ — >■ V\. 

Now e(U u V) + e(U\U 1 ,V 1 ) = e(U u V) + e(U, V{) - e{U x ,V x ) = e^V^ < \Vi\A v = |C/i|A y , 
where the second equality follows from the fact that N(Ux) = V\, and thus we have the desired 
inequality. □ 
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